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Abstract 

Analog-to-digital conversion (ADC) is a key bottleneck in scaling DSP-centric receiver architectures 
to multiGigabit/s speeds. Recent information-theoretic results, obtained under ideal channel conditions 
f^) ' (perfect synchronization, no dispersion), indicate that low -precision ADC (1-4 bits) could be a suitable 

choice for designing such high speed systems. In this work, we study the impact of employing low- 
fl^ . precision ADC in a carrier asynchronous system. Specifically, we consider transmission over the 

block noncoherent Additive White Gaussian Noise (AWGN) channel, and investigate the achievable 

O " 

£N| \ performance under low-precision output quantization. We focus attention on an architecture in which the 

receiver quantizes only the phase of the received signal: this has the advantage of being implementable 
without automatic gain control, using multiple 1-bit ADCs preceded by analog multipliers. For standard 
uniform Phase Shift Keying (PSK) modulation, we study the structure of the transition density of the 
resulting phase-quantized block noncoherent channel. Several results, based on the symmetry inherent in 
the channel model, are provided to characterize this transition density. Low-complexity procedures for 
computing the channel capacity, and for block demodulation, are obtained using these results. Numerical 
computations are performed to compare the performance of quantized and unquantized systems, for 
| different quantization precisions, and different block lengths. It is observed, for example, that with 

QPSK modulation, 8-bin phase quantization of the received signal recovers about 80-85% of the capacity 
attained with unquantized observations, while 12-bin phase quantization recovers more than 90% of the 
unquantized capacity. Dithering the constellation is shown to improve the performance in the face of 
^ ' drastic quantization. 

??; 

I. Introduction 

The economies of scale provided by integrated circuit implementation of sophisticated digital 
signal processing (DSP) algorithms have propelled mass market deployment of cellular and 
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wireless local area network systems over the last two decades. As we now look to scale up the 
speeds of such DSP-centric architectures by orders of magnitude (e.g., to build multigigabit/sec 
systems by exploiting the wide swath of spectrum in the 60 GHz band [Q]|), the analog-to- 
digital converter (ADC), which converts the received analog waveform into the digital domain, 
becomes a bottleneck: high-speed high-precision ADC is costly and power-hungry J2]|, 0. It 
is of interest, therefore, to explore the feasibility of system design with low-precision ADC. 
Recent information-theoretic results flU, show that for an ideal channel model (perfect 
synchronization, no dispersion), the loss in the Shannon capacity incurred by using 1-3 bits 
ADC is very acceptable, even at moderately high signal-to-noise ratio (SNR). In this paper, we 
consider a system without a priori carrier synchronization, i.e., there is a small frequency offset 
between the receiver's local oscillator and the incoming carrier wave, and investigate the impact 
of low-precision ADC on the achievable performance. 

To model the effect of carrier asynchronism, we consider a discrete-time complex baseband 
block noncoherent additive white Gaussian noise (AWGN) channel: if the receiver's local oscil- 
lator is not synchronized with that of the transmitter, the phase after downconversion is a priori 
unknown, but, for practical values of carrier offset, well approximated as constant over a block 
of symbols. The classical approach to noncoherent communication is to approximate the phase 
as constant over two symbols, and to apply differential modulation and demodulation. Divsalar 
and Simon [0 were the first to point out the gains that may be achieved by performing multiple 
symbol differential demodulation over a block of L > 2 symbols, under the assumption that the 
phase remains constant over the entire block (L is termed the channel coherence length). More 
recent work 0, (HI, [0 has shown that block demodulation, even for large L, can be implemented 
efficiently, and exhibits excellent performance for both coded and uncoded systems. 

We study the effect of low-precision receiver quantization for the block noncoherent channel, 
under Af-ary Phase Shift Keying (MPSK) modulation. Since PSK encodes information in the 
phase of the transmitted symbols, and since the channel impairment is also being modeled as a 
phase rotation, we investigate an architecture in which the receiver quantizes only the phase of 
the received samples, disregarding the amplitude information. Such phase-only quantization is 
attractive because of its ease of implementation: it eliminates the need for automatic gain control 
(since no amplitude information is used), and can be performed using only 1-bit ADCs preceded 
by analog pre-multipliers. For example, 1-bit ADC on both I and Q channels implements uniform 
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4-sector phase quantization. Uniform 8-sector quantization may simply be achieved by adding 
two new linear combinations, I+Q and I-Q (no analog pre-multipliers needed in this case), 
corresponding to a | rotation of the I/Q axis (see Fig. [[])• 

Our focus here is on computing the capacity and the uncoded error rates for communication 
over the phase-quantized block noncoherent channel. While brute force computation of these 
quantities has complexity that scales exponentially in the coherence length of the channel, we 
find that significant complexity reduction can be attained by understanding the nature of the 
channel block transition probability. A summary of our analytical and numerical results, for 
M-PSK modulation and uniform A'-sector phase quantization, is as follows: 

1) We begin by studying the structure of the input-output relationship of the phase quantized 
block noncoherent AWGN channel. For the special case when M divides K, we exploit 
the symmetry inherent in the channel model to derive several results characterizing the 
output probability distribution over a block of symbols, both conditioned on the input, 
and without conditioning. These results are used to obtain a low-complexity procedure for 
computing the capacity of the channel (brute force computation has complexity exponential 
in the block length L). 

2) We obtain low-complexity optimal block noncoherent demodulation rules. These rules are 
obtained by specializing the existing low-complexity procedures for block demodulation 
with unquantized observations, to our setting with quantized observations. 

3) A close analysis of the block demodulator reveals that, depending on the number of 
quantization sectors, the symmetries inherent in the channel model, while helping us 
reduce the computational complexities, can also have a dire consequence: they can make 
it impossible to distinguish between the effect of the unknown phase offset and the 
phase modulation. As a result, we may have two equally likely inputs for certain outputs, 
irrespective of the block length and the SNR, leading to severe performance degradation. In 
order to break the undesirable symmetries, we investigate the performance with a dithered- 
PSK input scheme, in which we rotate the PSK constellation across the different symbols 
in a block. 

4) Numerical results are obtained for QPSK input with 8 and 12 sector phase quantization, for 
different choices of the block length L, and compared with the unquantized performance 
(studied earlier in ifTOl as well). We find that 8-sector quantization, with a dithered-QPSK 
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input, achieves more than 80-85 % of the capacity achieved with unquantized observations 
(with an identical block length), while with 12-sector quantization, and no dithering, we 
can get as much as 90-95 % of the unquantized capacity. The corresponding loss in terms of 
SNR, for fixed capacity, varies between 2 - 4 dB for 8-sector quantization, and between 0.5 
- 2 dB with 12 sectors. In terms of the uncoded symbol error rates (SER), the performance 
degradation is of the same order. For instance, at SER = 10~ 3 , the loss for 8 and 12 sector 
quantization, compared to unquantized observations, is about 4 dB and 2 dB respectively. 
Related work: Two recent works, that are most closely related to our work are |[TTT|. [fT2|. Both 
these works consider a phase offset between the transmitter and the receiver, and investigate the 
impact of output quantization on the achievable performance. In ifTTTl . the authors study a mixed- 
signal receiver architecture, wherein the unknown phase offset is first estimated in post- ADC DSP 
(in a data aided, or, a non data aided manner), and then compensated for in the analog domain 
prior to the ADC. In [fT2l . the authors do not resort to analog domain compensation (e.g., due 
to hardware complexity constraints), but still use the knowledge of the phase offset (assumed to 
be somehow available) in the post-ADC processing. They observe that, in such a system, output 
(amplitude) quantization can cause significant degradation in the achievable transmission rate. 
Note that while both these works focus on explicit estimation/knowledge and use of the phase 
offset, the block noncoherent receiver studied here, rather amounts to an implicit joint estimation 
of the unknown phase offset and the unknown data symbols. Besides these works, and our own 
results reported earlier in lfl3l . [fT4|. we are not aware of any other work that investigates the 
impact of output quantization in a carrier asynchronous receiver. 

In this work, we consider only a phase offset (induced by the asynchronous local oscillators) 
between the transmitter and the receiver. However, the block noncoherent model extends to the 
setting of communication over narrowband slow fading channels as well, where the channel state, 
although unknown, can be assumed to be constant over a block of symbols. Indeed, this block 
fading model has been investigated extensively in the recent literature, ranging from capacity 
analysis [15J, to efficient architectures for block demodulation and decoding (0, |0O, flU, iTToTl . 
and references therein). While our work draws upon this extensive literature, it addresses, for 
the first time, the unique channel characteristics that arise due to (drastic) output quantization 
of the block noncoherent channel. 

Our prior information-theoretic studies to understand the impact of output quantization on the 
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capacity of the ideal AWGN channel include IfTTIh 0, BJ. More recent work that explores the 
impact of quantization on other aspects of communication system design includes, amongst oth- 
ers, the study of oversampling |fT8l , Ifl9ll , non-symmetric quantization ||20|| , automatic gain control 
lt2TTl . channel estimation performance E2l . performance over fading channels Il23l , broadcast and 
MAC channels ll24ll . and channels with memory [|25l . constellation shaping methods ll26ll . and 
study of system performance under the generalized mutual information criterion ll27l . 
Organization of the Paper: The rest of the paper is organized as follows. In Section HH we 
describe the channel model and the receiver architecture for phase quantization. In Section |inj 
we study the properties of the channel's transition probability distribution. Efficient mechanisms 
for computing the channel capacity and for block noncoherent demodulation are described in 
Sections |IV] and |Vj respectively. Numerical results are presented in Section |VT1 followed by the 
conclusions and a list of open issues in Section IVIIl 

Notation: Throughout the paper, we denote random variables by capital letters, and the specific 
value they take using small letters. Bold faced notation is used to denote vectors of random 
variables. E is the expectation operator. 



II. Channel Model and Receiver Architecture 
The received signal over a block of length L, after quantization is represented as 

Z t = Q(S ie ^ + N t ) , Z = 0,1,..- ,L-1, (1) 

where, 

• S := [Sq Si ■ ■ • Sl-i] is the transmitted vector, 

• $ is an unknown constant with uniform distribution on [0, 2ir), 

• N := [N ■ ■ ■ Nl-i] is a vector of i.i.d. complex Gaussian noise with variance a 2 = N /2 
in each dimension, 

• Q:C— > JC = {0, 1, ■ ■ ■ , K — 1} denotes a quantization function that maps each point in 
the complex plane to one of the K quantization indices, and 

• Z := [Zq Z\ ■ ■ ■ Zl-i] is the vector of quantized received symbols, so that each Zi £ /C. 
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Each Si is picked in an i.i.d. manner from a uniform M-PSK constellation denoted by the set 
of points A = {e je °, e jei , • ■ ■ , e^- 1 }, where 6 m = (9 m _ x + ff) Q, for m = 1, 2, • ■ • , M - 1. 

We now introduce the random vector X = [X X x • • • Xz,_i], with each Xj picked in an 
i.i.d. manner from a uniform distribution on the set {0, 1, • • • , M — 1}. Our channel model (OQ) 
can now equivalently be written as 

Z l = Q{e^e^ + N l ) , I = 0, 1, • • • , L - 1 , (2) 

with every output symbol Zi E {0, 1, • • • , K — 1} as before, and every input symbol X L E 
{0, 1, ■ ■ • , M — 1}. The set of all possible input vectors is denoted by X, while Z denotes the 
set of all possible output vectors. 

We consider A'-bin (or A-sector) phase quantization: our quantizer divides the interval [0, 2n) 
into K equal parts, and the quantization indices go from to K — 1 in the counter-clockwise 
direction. Fig. QJb) depicts the scenario for K=S. Thus, our quantization function is Q(c) = 
Larg(c)|(^)J, where c E C, and [p\ denotes the greatest integer less than or equal to p. Such 
phase quantization can be implemented using 1-bit ADCs preceded by analog multipliers which 
provide linear combinations of the / and Q channel samples. For instance, employing 1-bit 
ADC on / and Q channels results in uniform 4-sector phase quantization, while uniform 8- 
sector quantization can be achieved simply by adding two new linear combinations, I+Q and 
I-Q, corresponding to a n/A rotation of HQ axes (no analog multipliers needed in this case), 
as shown in Fig. dja). 

We begin our investigation by studying the inherent symmetry in the relationship between 
the channel input and output. This yields several results that govern the structure of the output 
probability distribution, both conditioned on the input (i.e., P(Z|X)), and without conditioning 
(i.e., P(Z)). These distributions are integral to computing the channel capacity (one of our focuses 
in this paper), as well as for soft decision decoding (not considered here). While brute force 
computation (computing P(z|x) for every z E Z and every x E X) of these distributions has 
exponential complexity in the block length, our results allow their computation with significant 
reduction in the complexity. 

'Unless stated otherwise, any arithmetic operations for phase angles are assumed to be performed modulo 2-k. For the output 
symbols Zi, the arithmetic is modulo K, while for the input symbols Xi (introduced immediately after in the text ), it is modulo 
M. 
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Fig. 1. Receiver architecture for 8-sector quantization. 



Note: Throughout the paper, we assume that the PSK constellation size M, and the number 
of quantization bins K, are such that K = aM for some positive integer a. We illustrate our 
results with the running example of QPSK with 8-sector quantization (so that a = 2), depicted 
in Fig. Eta) 

III. Input-Output Relationship 

Conditioned on the channel phase $, P(Z|X, $) is a product of individual symbol probabilities 
P(Zi\Xi, $). We therefore begin by analyzing the symmetries in the latter. 

A. Properties of P(Zi\X h $) 

We have that P(zi\xi,(f)) is the probability that aig{e^ ex i + ^ + iVj) belongs to the interval 
[jt-zi ^w{z\ + 1))- In other words, it is the probability that the complex Gaussian noise Ni takes 
the point e^ dx i +< ^ on the unit circle, to another point whose phase belongs to [j^zi j^(zi + 1)). 
Due to the circular symmetry of the complex Gaussian noise, this is the same as the probability 
that Ni takes the point e^ 6x ' + ^ + ^^ on the unit circle, to another point whose phase belongs to 
[jf{zi + i) j^{zi + 1 + i)), where i is an integer. We thus get our first two results. 

Property A-l: P{zi\x h 0) = P(z t + i\x h + i^). 

Property A-2: P(zi\xi, 4>) = P{zi + ia\xi + i, 4>). 

Note that 6 Xl+i = 9 Xl + jji = XI + ^(ia), which gives Property A-2. 

Property A-2 simply states that if we jump from one point in the M-PSK constellation to 
the next, then we must jump a = jt quantization sectors in order to keep the conditional 
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Fig. 2. QPSK with 8-sector quantization (i.e., M=4, K=8). a) depicts how the unknown channel phase cj> results in a rotation 
of the transmitted symbol (square : original constellation , circle : rotated constellation), (b) and (c) depict the circular symmetry 
induced in the conditional probability P(z\x,(f>) due to the circular symmetry of the complex Gaussian noise, (b) shows that 
increasing (f> by 2n/K = (7r/4) and z by 1 will keep the conditional probability unchanged, i.e., P(z — 3|a;,<j!>) = P(z = 
4\x, <f> + 2n/K). (c) shows that increasing x by 1 and z by 2 = (K/M) will keep the conditional probability unchanged, i.e., 
P(z = 2\x, </>) = P(z = 4\x + 1, </>). 

probability invariant. This is intuitive, since the separation between consecutive points in the 
input constellation is 2n/M, while each quantization sector covers an angle of 2%/K. For QPSK 
with K = 8, Fig. (2b) and |2tc) depict example scenarios for the two properties. 

If we put i = —xi in Property A-2, we get the following special case, which relates the 
conditioning on a general x\ to the conditioning on 0. 

Property A-3: P(zi\xi,4>) = P(zi — axi\0, 0). 

To motivate our final property, we consider our example of QPSK with K — 8. While we have 
8 distinct quantization sectors, if we look at Fig. 2(a), the orientation of these 8 sectors relative 
to the 4 constellation points (shown as squares) can be described by dividing the sectors into 2 
groups : {0, 2, 4, 6}, and {1, 3, 5, 7}. For instance, the positioning of the first sector (z = 0) w.r.t. 
x = is identical to the positioning of the third sector (z = 2) w.r.t. x — 1 (and similarly z = 4 
w.r.t x = 2, and z = 6 w.r.t x = 3). On the other hand, the positioning of the second sector 
(z — 1) w.r.t. x = is identical to the positioning of the fourth sector (z = 3) w.r.t. x = 1 (and 
similarly z = 5 w.r.t x — 2, and z = 7 w.r.t x = 3). In terms of the conditional probabilities, 
this implies, for example, that we will have P(zi = 7\xi = 3,0) = P(zi = l\xi = 0,0), and 
similarly, P(zi = 6\xi = 3,0) = P(zi = 0|x/ = 0,0). In general, we can relate the conditional 
probability of every odd z\ with that of z\ — 1, and similarly of every even zi with that of zi = 0, 
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with corresponding rotations of the symbol x\. For general values of K and M, the number of 
groups equals a = j^, and we can relate the probability of any zi with that of zi mod a. 

Property A-4: Let z\ = qia + 77, where qi is the quotient on dividing zi by a, and ri is the 
remainder, i.e, 77 = zi mod a. Then, P(zi\xi, <fi) = P(zi mod a\xi — qi, 4>). 

While this result follows directly from Property A-2 by putting i = —q t , it is an important 
special case, as it enables us to restrict attention to only the first a sectors (Zi E {0, 1, • • • , a— 1}), 
rather than having to work with all the K sectors. As detailed later, this leads to significant 
complexity reduction in capacity computation. 

We now use these properties to present results for P(Z|X). 

B. Properties o/P(Z|X) 

Property B-l: Let 1 denote the row vector with all entries as 1. Then P(z|x) = P(z + il|x). 

Proof: For a fixed x, increasing each zi by the same number i leaves the conditional probability 
unchanged, because the phase $ in the channel model © is uniformly distributed in [0, 2tt). A 
detailed proof follows. We have 



The second equality follows by the fact that the components of Z are independent conditioned 
on X and $. Property A-l gives the third equality. A change of variables, $ = $ + %^ gives 
the fourth equality (since $ is uniformly distributed on [0, 2tt), so is $), thereby completing the 



Remark 1: For the rest of the paper, we refer to the operation z — > z + il as constant addition. 

Next, we observe that the conditional probability remains invariant under an identical permu- 
tation of the components of the vectors z and x. 

Property B-2: Let II denote a permutation operation, and Ilx (Ilz) the vector obtained on 
permuting x (z) under this operation. Then, P(z|x) = P(Tlz|nx). 





,/=o 

X-l 





proof. 



DRAFT 



10 



Proof: As in the proof of Property 1, the idea is to condition on <3> and work with the 
symbol probabilities P(zi\x h <&). Consider P(z|x, $) = P(zi\x h and P(Ilz|nx, $) = 

[\.i=q P((nz)J(rix) / , $). Since multiplication is a commutative operation, we have P(z|x, $) = 
P(Ilz|nx, $). Taking expectation w.r.t. $ completes the proof. ■ 

The next two results extend properties A-3 and A-4. 

Property B-3: Define the input vector x = [0 • ■ -0]. Then, P(z|x) = P(z — ax|x ), where 
a = jj, and the subtraction is performed modulo K. 

Property B-4: Let zi = qia+ri, where qi is the quotient on dividing zi by a, and ri is the remain- 
der, i.e, ti = zi mod a. Define q = [q , ■ ■ ■ , q^-i], and, z mod a = [zq mod a ■ ■ ■ Zl-\ mod a]. 
Then P(z|x) = P(z mod a \ x — q). 

Proofs: The properties follow from A-3 and A-A respectively, by first noting that the vector 
probability P(z|x, $) is the product of the scalar ones, and then integrating over $ . ■ 

C. Properties of P(Z) 

We now consider the unconditional distribution P(z). The first result states that P(z) is 
invariant under constant addition. 
Property C-l: P(z) = P(z + il). 

Proof: Using Property B-l, this follows directly by taking expectation over X on both sides. 

■ 

We now extend Property B-2 along similar lines to show that P(z) is invariant under any 
permutation of z. 

Property C-2: P(z) = P(IIz). 

Proof: We have P(z) = ^ J2 xe x P ( z l x )- Usin S Property B-2, we get P(z) = ^ Y.* e x P (nz|nx). 
Since the permutation operation results in a one-to-one mapping (every unique choice of x G X 
results in a unique Ilx G X), we can rewrite the last equation as P(z) = 2~2^ex P(ri z l x ) = 
P(nz). ■ 

Our final result extends Property B-4. 

Property C-3: Let a — Then P(z) = P(z mod a). 

Proof: Using the same notation as in Property B-4, we have P(z|x) = P(z mod a | x — q) . 
Noting that the transformation x — > x — q is a one-to-one mapping, the proof follows on the 
same lines as the proof of Property C-2. m 

DRAFT 



1 1 



Example: For QPSK with K = 8 and L = 4, P(z = [5 7 2 4]) = P(z = [1 1 0]). 
We now apply these results for low-complexity capacity computation. 

IV. Capacity Computation 
We wish to compute the mutual information 

/(X; Z) = H(Z) - H(Z\X). 

We first discuss computation of the conditional entropy. 

A. Conditional Entropy 

We have H(Z\X) = #( Z I X ) P ( X )> where #( z l x ) = ~ J2z P ( z l x ) lo S P ( z l x ) is the 
entropy of the output when the input vector X takes on the specific value x. Our main result in 
this section is that if (Z|x) is constant Vx. 

Property D-l: H(Z\x) is a constant. 

Proof: We show that for any input vector x, H(Z\x) = if(Z|x ), where x = [0- --0] as 
defined before. We have 

if(Z|x) = - ^P(z|x)logP(z|x) 

z 

= — P(z — ax|x ) log P(z — ax|x ) , (3) 

z 

where the second equality follows from Property B-3. Noting that the transformation z -> z-ax 
is a one-to-one mapping, we can rewrite © as 



if (Z|x) = -J2 P ( z l x °) lo S P ( z l x o) = H ( Z \*°< 



(4) 



Thus, if(Z|X) = if(Z|x ), but brute force computation of if(Z|x ) still has exponential 
complexity: P(Z|x ) must be computed for each of the K L possible output vectors Z. However, 
we find that it suffices to compute P(Z|x ) for a much smaller set of Z vectors. 

Using Property B-2, we have P(z|x ) = P(Ilz|rixo). Since x = [0..0], any permutation of 
x gives back x . Hence, P(z|x ) = P(Ilz|x ). Combined with Property B-l, we thus get that 
it suffices to compute P(z|x ) for a set of vectors Sz in which no vector can be obtained from 
another by performing the joint operations of constant addition and permutation. While we do 
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not have a method to obtain the set Sz exactly, we exploit the preceding properties to obtain, 
and, restrict attention to, a sub-optimal set Sz 2 having cardinality C[K + L — 2, L — 1), which 
is still significantly lower than the exponential figure of K L . For instance, with K = 8, and 
L = {3,4,5,6,7}, the cardinality of Sz 2 is {36,120,330,792,1716}, whereas the exponential 
figure K L is {512,4096,32768,2.6 x 10 5 , 2.1 x 10 6 }. Details regarding the construction of the 
set Sz 2 are provided in Appendix [A] 

Once we have constructed the set Sz 2 , the probability P(z|x ) can be numerically computed 
for every vector z in this set. Computation of the entropy H(Z\x ) is then straightforward. See 
Appendix [A] for the details. 

B. Output Entropy 

The output entropy is H(Z) = —^2 Z P(z)logP(z). Brute force computation requires us to 
know P(z) Vz G Z, which clearly has exponential complexity. However, using Properties C-l, 
C-2 and C-3, we get that it is sufficient to compute P(z) for a set of vectors Sz in which no 
vector can be obtained from another one by performing the operations of constant addition and 
permutation, and also, the vector components G {0, 1, ■ ■ • , a — 1}. This is similar to the situation 
encountered earlier in the last subsection, except that the vector components there were allowed to 
be in {0, 1, • • • , K — 1}. To exploit this for further complexity reduction, we can begin by defining 
the set Z to be the set of vectors in which the vector components take values in{0,l,---,a — 1} 
only. Since P(z) = P(z mod a), a moment's thought reveals that each vector in Z has the same 
probability as a set of (^) L = M L distinct vectors in Z, and the sets corresponding to different 

vectors are disjoint. Thus H(Z) = - P(z) log P(z) = -M L P(z) log P(z). To obtain 

z z 
{P(z)} for z G Z, we can follow exactly the same procedure as described in the last subsection, 

with K being replaced by a. In particular, we need to compute P(z) only for C(a + L — 2, L — 1) 

vectors. 

Example: For QPSK with 8 sectors (so a = 2), the relevant vectors for block length 2 are 
[0 0] and [0 1]. 

Computation of P(Z): We now need to compute P(z) = P( z l x ) f° r eacn °f m e 

C(a + L — 2, L — 1) vectors. A brute force approach is to compute P(z|x) for each x, but again, 
has exponential complexity. We exploit the structure in z to reduce the number of vectors x for 
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which we need P(z|x). Specifically, we have that each z; t G {0, 1, • • • , a — 1}. Since there are 
only a different types of components in z, for block length L > a, some of the components in 
z will be repeated. For any x, we can then use Property B-2 to rearrange the components at 
those locations for which the components in z are identical, without changing the conditional 
probability. For instance, let z m = z n for some m,n. Then, P(z|x) = P(z|Ilx), where Ilx 
is obtained from x by rearranging the components at locations m and n. To sum up, we can 
restrict attention to a set of vectors Sx in which no vector can be obtained from another one 
by permutations between those locations for which the elements in z are identical. Construction 
of this set Sx, and the subsequent computation of P(Z), is discussed in Appendix |Bj For the 
example scenario of QPSK with 8-sector quantization (so that a = 2), and block length L = 8, 
the worst case cardinality of the set Sx is 1225, whereas the exponential figure of M L = 65536. 

In the next section, we consider efficient block noncoherent demodulation for the phase- 
quantized channel. This allows us to evaluate the uncoded symbol error rates. Numerical results 
for uncoded performance and channel capacity are subsequently provided in Section [VH 

V. Block Noncoherent Demodulation 

We consider the generalized likelihood ratio test (GLRT) for block noncoherent demodulation. 
This entails a joint maximum likelihood estimation of the unknown block of input symbols and 
the unknown channel phase. Specifically, given the received vector z, the GLRT estimate for x 
is given by 

x(z) = argmax max P(z|x, 0) . (5) 

xe* *e[o,27r) 

Brute force computation of the solution to §5$ has prohibitive complexity, since the cardinality 
of the input space X grows exponentially with the block length. For unquantized observations, 
it is known that the solution can rather be obtained with linear- logarithmic complexity [8J. The 
key idea used to obtain this complexity reduction works for quantized observations as well, as 
we illustrate ahead. 

First, we make some observations resulting due to the symmetry of our channel model. As 
before, we let a = j^, and q = [q • ■ ■ qL-i], where qi is the quotient obtained on dividing zi by 
a. Using Property A-4, we get 

x(z) = argmax max P(z mod a|x — q, 0) , (6) 
xeA- <^e[o,27r) 
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which in turn gives 

x(z) = x(z mod a) + q(z) , (7) 

where we have explicitly noted that q is a function of z. This result is useful in the sense that 
the solution for a received vector z can be easily obtained if the solution for z mod a is known, 
since computing q(z) is a trivial task. Hence, we restrict attention to computing the GLRT 
solution only for those z for which the vector components £ {0, 1, • • • , a — 1}. Also observe 
that P(z|x, 0) = P(z|x + 2,0 — ijj). This implies that the demodulator can not distinguish 
between two input vectors that are related by the operation of constant addition. This is well 
known (for unquantized observations), and is the basis for using techniques such as differential 
modulation. 

To obtain a low-complexity solution, the key is to interchange the order of maximization in 
©. Consider 

maxmax Pfzlx, 6) . (8) 

For a fixed 0, the inner maximization over x is straightforward since it can done in a coherent 
manner, i.e., on a symbol by symbol basis. For = 0, let the coherent solution be denoted by 
c(0) = [co(0) • • • C£_i(0)]. (We dropped the dependence on z to simplify notation). Note that 

this means q(0) = argmax P(zi\xi,(p = 0). As is increased, the coherent solution c will 

x t e{o,- ,m-i} 

change. However, this will happen only when any of the individual solutions q changes. The 
crucial observation now is that as is varied over to ||, each of the individual solutions q(0) 
changes only once. In other words, for each I, there is a crossover angle a t , such that 

q(0) = q(0) , if < < a t 

= c/(0) + 1 , if ai < < — . 

The exact crossover angles are easy to obtain as functions of zi,K,M and the locations of 
the input constellation points. Now, since we only consider those z vectors for which every 
component E {0, • • • , a — 1}, there can be at most a distinct crossover angles. Hence, when 
is varied between [0, f|), the number of distinct coherent solutions to the inner maximization in 
([8]) is at most a, and these solutions can be obtained simply by sorting the crossover angles in an 
ascending order. For each of these (at most) a input vectors, we can now numerically compute 
the metric max P(z|x, 0), and pick the one with the largest metric as the GLRT solution. 

0G[O,2vr) 
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This numerical computation can be done, for example, by fine discretization of the interval 
[0, 2n), and computing P(z|x, 0) for every in this discrete set. The number of computations 
(multiplications) required to obtain max P(z|x, </>) then scales linearly in the block length L. 

06[O,2tt) 

Note that we restricted attention to G [0, ||) only while performing the inner maximization 
in ([8]). This is because if we go on beyond |j, any new solution we get, say Ci will be related to 
one of the existing solutions, say c 2 , by the operation of constant addition, so that the noncoherent 
demodulator can not distinguish between Ci and c 2 . 

VI. Numerical Results 

We present results for QPSK with 8-sector and 12-sector phase quantization, for different 
block lengths L. We begin with the symbol error rate (SER) plots for block demodulation. Fig. 
[3] (left plot) shows the results for 8-sector quantization. Looking at the topmost curve, which 
corresponds to L = 2, we find that the performance is disastrous. As the SNR is increased, the 
SER falls off extremely slowly. A close analysis of the block demodulator reveals that the reason 
behind this is an ambiguity in the demodulator decision rule: for certain outputs z, irrespective of 
the SNR, the demodulator always returns two equally likely solutions for the input x. While we 
do not provide a complete analysis of this ambiguous behavior, an example scenario is shown 
in Fig. |4] to give insight. If the quantized output vector is z = [1 0], then we find that P(z|x, </>) 
is maximized by two equally likely pairs, (xi,0i) = ([0 0],0), and (x 2 ,02) = ([0 3],7r/4), so 
that the block demodulator, which does joint maximum likelihood estimation over the input and 
the unknown phase, becomes ambiguous. In other words, the symmetry inherent in the channel 
model, which on the one hand helped us reduce the complexity of capacity computations, is 
also making it impossible to distinguish between the effect of the unknown phase offset and the 
phase modulation on the received signal, resulting in poor performance. While we showed the 
performance plot for L = 2 only, we find that the ambiguity persists for larger block lengths also. 

Possible ways to break undesirable symmetries could be to use non-uniform phase quantiza- 
tion, or to employ dithering. Here we investigate the role of the latter. We can either dither the 
QPSK constellation points at the transmitter, or use analog pre-multipliers to dither the phase 
quantization boundaries at the receiver. We use a transmit dither scheme in which we rotate the 
QPSK constellation by an angle of \{^£) from one symbol to the next. Fig. \5\ shows this scheme 
for block length L = 2 and K = 8. The constellation used for the second symbol (shown by the 
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Fig. 3. Symbol error rate performance for QPSK with 8-sector phase quantization (left figure) and 12-sector phase quantization 
(right figure), for block lengths varying from 2 to 8. Also shown for comparison are the curves for coherent QPSK, and 
noncoherent unquantized QPSK. 




Fig. 4. Ambiguity in the block noncoherent demodulator. If the received vector is [1 0], then (X = [0 0],<f> = 0), and, 
(X = [0 3], <j> = ?) are both equally likely solutions. 



diamond shape) is dithered from the constellation used for the first symbol by an angle of ir/8. 
With this choice of transmit constellations, we find that the ambiguity in the block demodulator 
is removed, and hence the performance is expected to improve. The results in Fig. |3] (left plot) 
indeed show a significant performance improvement compared to the no-dithering case, although 
increasing the block length does not provide much gain. At SER of 10~ 3 , 8-sector quantization 
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(a) (b) 



Fig. 5. (a) Standard PSK : the same constellation (the one shown) is used for both symbols in the block, (b) Dithered-PSK : 
the constellations used for the two symbols are not identical, but the second one is a dithered version of the first one. 




SNR(E s /N o ) (dB) 

Fig. 6. Performance comparison for QPSK with block length L = 6 : plots depict the capacity of the block noncoherent channel 
without quantization, and with 8-sector quantization (with and without dithering). Also shown is the capacity for coherent QPSK. 

with L = 8 results in a loss of about 4 dB compared to unquantized observations. 

On the other hand, if we consider the performance with 12-sector quantization, it is observed 
that the block demodulator performs well, and dithering is not required. This suggests that 12- 
sector quantization does not result in any undesirable symmetries in the channel model. Fig. [3] 
(right plot) shows the performance for different block lengths. At SER of 10~ 3 , and L — 8, the 
loss compared to the unquantized observations is reduced to about 2 dB. 

Next we show the plots for channel capacity. For all our results, we normalized the mutual 
information /(X; Z) by L-l to obtain the per symbol capacity, since in practice the successive 
blocks can be overlapped by one symbol due to slow phase variation from one block to the next. 
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Fig. [6] shows the results for 8-sector quantization. (To avoid clutter, we show the results for L = 6 
only.) Also shown for reference are the capacity values for the coherent case, and for the block 
noncoherent case without any quantization. Despite the disastrous performance of the uncoded 
scheme witnessed earlier, we see that, in terms of the channel capacity, 8-sector quantization 
scheme recovers more than 80-85% of the capacity obtained with unquantized observations, 
for SNR > 2-3 dB . However, the capacity approaches 2 bits/channel use extremely slowly. 
Since H(X.) is constant, this implies that H(K\Z) falls off very slowly as SNR — > oo, which 
is consistent with the earlier observation that there is significant ambiguity in X, given Z, even 
at high SNR. The performance improvement obtained by using a dithered-QPSK input is also 
shown in Fig. [6l It is seen that the slow increase of capacity towards 2 bits/channel use has been 
eliminated. E 

While the simple transmit dither scheme considered here has improved the performance (in 
terms of both the SER, as well as channel capacity), we hasten to add that there is no optimality 
associated with it. A more detailed investigation of different dithering schemes and their potential 
gains is therefore an important topic for future research. 

In Fig. |7J we plot the capacity curves for QPSK with 12-sector quantization, for block length 
L = {2, 4, 6, 8}. Also shown for reference are the coherent and unquantized block noncoherent 
performance curves. For identical block lengths, the loss in capacity (at a fixed SNR > 2-3 dB) 
compared to the unquantized case is less than 5-10 %, while the loss in power efficiency (for 
fixed capacity) varies between 0.5-2 dB, and as before, dithering is not required. 

VII. Conclusions 

We have investigated the capacity limits and the uncoded error rates imposed by the use of 
low-precision phase quantization in a carrier-asynchronous receiver. Building up on our earlier 
results obtained under ideal channel conditions [4], the results here indicate that low-precision 
quantization could be a feasible option to overcome the ADC bottleneck in practical high- 
speed wireless system design. Two critical observations, that we make, however, are that low- 
precision quantization might lead to unexpected and ambiguous receiver operation when dealing 

2 Since the low-complexity procedure outlined in Section ITVl does not work once we dither, we used Monte Carlo simulations 
to compute the capacity with dithering. 
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Fig. 7. Performance comparison for QPSK : plots depict the capacity of the coherent channel, unquantized block noncoherent 
channel (different block lengths), and the 12-sector quantized block noncoherent channel (different block lengths). 

with unknown parameters, and, mechanisms such as dithering might be essential to attain good 
performance in the face of such ambiguities. Indeed, with low-precision ADC, dithering has 
been found to be useful while dealing with other aspects of receiver design as well : see [|22|. 
f2\\ for the crucial role played by dithering in channel estimation and automatic gain control 
with quantized receiver observations. 

There are several open issues to be addressed. Given the performance improvement obtained 
using the simple dithering scheme considered here, a more detailed investigation of different 
dithering schemes is required. Another possibility to consider, which we have not explored here, 
would be non-uniform phase quantization. While we have restricted attention to PSK inputs in 
this work, it is important to evaluate performance with QAM alphabets as well, in which case we 
need to consider amplitude quantization. Note that, amplitude quantization can, in principle help 
improve performance with PSK inputs as well, especially if the SNR is low, and the block lengths 
are small. Another topic of interest is the study and development of practical capacity approaching 
coded modulation strategies for phase quantized communication. An important practical issue 
is determining whether timing synchronization (which is assumed in the model here) can also 
be attained using phase-quantized samples, or whether some form of additional information 
(perhaps using analog techniques prior to the ADC) is required. 
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As with prior work in the literature, we assumed that the phase across the different blocks 
varies independently. While this allows analytical tractability, the continuous variation of the 
phase from one block to the next can be used to enhance performance, especially when we are 
constrained to using low-precision samples. How best to leverage this memory might be worth 
investigating. 
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Appendix A 
Computation of H(Z\x ) 

Instead of jointly accounting for constant addition and permutation, we first account for 
constant addition, and then for permutation. Specifically, we first note that using Property B-l, 
it suffices to compute P(z|x ) only for a set of vectors Sz x for which the first symbol is 0. Next, 
using the fact that P(z|x ) = P(Llz|x ), within the set Sz x , we can further restrict attention to 
a subset Sz 2 in which no vector can be obtained from another one by a permutation operation. 
Since permutations don't matter, all we are interested in is how many symbols of each type are 
picked, so that obtaining the set Sz 2 is equivalent to the well-known problem of distributing L-l 
identical balls into K distinct boxes, with empty boxes allowed. The number of ways to do this 
is C(K + L — 2,L — 1), and each of these combinations can be obtained easily using standard 
known procedures. 

Once we have the set Sz 2 , we can numerically compute the probability P(z|x ) for every vector 
in Sz 2 - The entropy i/(Z|x ) can then be obtained as follows. For z E Sz 2 , let n(z) denote the 
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number of distinct permutations that can be generated from it, while keeping the first symbol 
fixed. This is equal to J^-i » where is the number of times the symbol i occurs in z. The con- 

lli=0 T i 

ditional entropy then is H(Z\x ) = — P(z|x ) log P(z|x ) = — A'P(z|x ) log P(z|x ) = 

z S Zl 

-^An(z) P(z|x )logP(z|x ). 

Sz 2 

Appendix B 
Computation of P(Z) 

To obtain the set Sx, we divide the L locations into a groups, and permutations are allowed 
only between locations belonging to the same group. The problem then breaks down into a 
sub-problems. Specifically, let the number of locations in the groups be n ,nx, ■ ■ ■ , n a _i, then 
we need to distribute identical balls into M distinct boxes, for each i. The required number 
of combinations is the product of the individual solutions. While for large a, the reduction 
in complexity may not be huge, for small values of a (which is the paradigm of interest in 
this work), the savings will be significant. For instance, for QPSK with L = 8, and a = 2, 
the worst case (which happens when n = n{) number of combinations is 1225, compared 
to the exponential figure of M L = 65536. Once the set Sx has been obtained, we can get 

p ( z ) = jjt Ex G s x ?(«)P(z|x). Here, q(x) = UZo n^W*))! ' where n ^ is the number 
of times the input symbol j occurs in the locations belonging to group i, for the vector x. 
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